Wideband coding of speech using neural network gain adaptation

نویسندگان

  • Cheung-Fat Chan
  • Man-Tak Chu
چکیده

In this paper, a high-quality wideband speech coder is proposed. The coding structure resembles a LD-CELP coder, however, several novel improvements are made. The gain adapter for the stochastic codebook is driven by a neural network and it updates the excitation gain in a sample-by-sample fashion. The purpose of incorporating a neural network is to exploit both the intraand inter-frame correlation of speech signal in a non-linear manner. A psychoacoustic model instead of a simple perceptual weighting filter is used to shape the quantization noise. Simulation result shows that the proposed coder can achieve transparent coding of wideband speech at 16 kbps.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

Improving wideband acoustic models using mixed-bandwidth training data via DNN adaptation

In the past few years, deep neural networks (DNNs) have achieved great successes in speech recognition. The deep network model can be viewed as a series of feature transforms followed by a log-linear classifier. For input of speeches from different bandwidths, although the hidden layer transform and log-linear classification can be shared, the input layer transforms should be specially designed...

متن کامل

Sensitivity Analysis of a Wideband Backward-wave Directional Coupler Using Neural Network and Monte Carlo Method (RESEARCH NOTE)

In this paper sensitivity analysis of a wideband backward-wave directional coupler due to fabrication imperfections is done using Monte Carlo method. For using this method, a random stochastic process with Gaussian distribution by 0 average and 0.1 standard deviation is added to the different geometrical parameters of the coupler and the frequency response of the coupler is estimated. The appli...

متن کامل

Complexity Reduction of LD-CELP Speech Coding in Prediction of Gain Using Neural Networks

Reducing the computational complexity is desired in speech coding algorithms. In this paper, three neural gain predictors are proposed which can function as backward gain adaptation module of low delay-code excited linear prediction (LD-CELP) G.728 encoder, recommended by International Telecommunication Union-Telecom sector (ITU-T, formerly CCITT). Elman, multilayer perceptron (MLP) and fuzzy A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997